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ABSTRACT 

Living organisms change their proteome dramatically 
to sustain a stable internal milieu in fluctuating envir- 
onments. To study the dynamics of proteins during 
stress, we measured the localization and abundance 
of the Saccharomyces cerevisiae proteome under 
various growth conditions and genetic backgrounds 
using the GFP collection. We created a database (DB) 
called 'LoQAtE' (Localizaiton and Quantitation Atlas 
of the yeast proteomE), available online at http:// 
www.weizmann.ac.il/molgen/loqate/, to provide 
easy access to these data. Using LoQAtE DB, users 
can get a profile of changes for proteins of interest as 
well as querying advanced intersections by either 
abundance changes, primary localization or localiza- 
tion shifts over the tested conditions. Currently, the 
DB hosts information on 5330 yeast proteins under 
three external perturbations (DTT, H2O2 and nitrogen 
starvation) and two genetic mutations [in the chap- 
eronin containing TCP1 (CCT) complex and in the 
proteasome]. Additional conditions will be uploaded 
regularly. The data demonstrate hundreds of local- 
ization and abundance changes, many of which 
were not detected at the level of mRNA. LoQAtE is 
designed to allow easy navigation for non-experts in 
high-content microscopy and data are available for 
download. These data should open up new perspec- 
tives on the significant role of proteins while combat- 
ing external and internal fluctuations. 

INTRODUCTION 

The budding yeast Saccharomyces cerevisiae robustly 
adapts to a variety of fluctuations such as extreme environ- 
ments, genetic mutations and life phases. Many cellular 



components within the cell (such as DNA, RNA, proteins 
and lipids) rearrange dynamically to promote survival 
(1-17). However, although vast amounts of data have 
been collected on chromatin modifications and transcrip- 
tional responses under changing conditions, only a handful 
of pioneering studies describe the dynamics of proteins 
(2,9-11). To enable a broader view of the proteomic 
response under varying conditions, we constructed an auto- 
mated microscopy setup that allows reproducible, accurate 
and sensitive measurements of the localization and abun- 
dance of fluorescently tagged proteins at single-cell reso- 
lution. Using this setup, we visualized the yeast Green 
Fluorescence Protein (GFP) library in which nearly 5500 
yeast proteins are tagged with a C terminal GFP under 
their natural promoter (18). Visualizing all strains under 
various internal and external fluctuations, we found >100 
proteins that could be detected for the first time and 
detemiined their subcellular localization (4). We then 
tracked proteome-level changes using two methods: first, 
we classified proteins into 13 localization categories and 
detected hundreds of proteins that shift between different 
cellular locals under stress (4,19). We then computed abun- 
dance based on the fluorescence intensity of the GFP signal 
and found all proteins that change their abundance (4,19). 
Comparison with transcriptome data (14) demonstrated 
that up to 60% of proteomic changes could not be pre- 
dicted from the dynamics of their transcripts, therefore, 
emphasizing the importance of studying proteins directly 
(20,21). To easily browse and download all aspects of 
these data, we built the Localization and Quantitation 
ATlas of the yeast proteomE (LoQAtE) database (DB). 
The current article presents how 'LoQAtE' can be easily 
used for uncovering new biological phenomena. 

DATABASE DESIGN AND IMPLEMENTATION 

The DB is constructed such that both browsing for a 
protein of interest and advanced intersections of various 
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parameters can be performed easily. LoQAtE holds infor- 
mation on 328 localization shifts of proteins from one 
organelle to another and 1400 abundance change events. 
Each change may contribute directly to the ability of the 
cell to respond to the perturbation studied, or it can serve 
as a marker for cellular changes if it is a downstream effect 
of the stress. Using LoQAtE, such changes can be 
uncovered and studied in several methods: 

Studying a specific protein 
Quick search 

The most simple utility of the DB is to insert a single name 
or a tab-delimited list of open reading frames (ORFs)/gene 
names (for an example of the results screen see Figure lA) 
and receive a 'quick search' result. This result presents the 
localization and abundance of all proteins of interest 
under all conditions tested as well as the 'fold change' of 
protein abundance in each condition relative to a control 
basehne condition (synthetic defined medium) and a stat- 
istical assessment of the significance of this change. Full 
results including the microscopic images are presented for 
up to 100 proteins (to enable rapid display of the search 
results). At any moment, the full hst of results can be 
downloaded either as a csv file summarizing the data or 
as a folder of the raw images. For ease of utihty the default 
channel shown in the results screen is the GFP channel; 
however, the bright field image is available in a clickable 
fashion. 

Studying a group of proteins 

The abihty to search by more advanced criteria enables 
users to uncover patterns in protein behavior that may not 
have been revealed by examining a single protein. For 
example, categorizing groups of proteins that shift their 
localization from the same origin to the same target or- 
ganelle may allow a unique view on the functional changes 
in each organelle, provide a basis for analysis of the 
factors mediating these movements or may uncover a 
joint function. In another example, regulatory elements 
may be found for groups of proteins that are down/ 
upregulated concordantly. We have, therefore, built 
several interfaces to enable advanced queries of the data 
and to enable such grouping. 

Advanced search 

1. Search by localization. It allows the user to choose all 
proteins localized to a specific organelle/s in a defined 
condition. In each query, one condition can be chosen 
and as many as 13 localization categories (or 'below 
threshold' that represents no expression) can be picked. 
Results are presented and can be downloaded as described 
earlier in text. 

2. Search by abundance. It allows the user to choose a set 
of proteins that changed abundance concordantly for a 
specific condition or for several conditions in an 'AND' 
relation. To use this option, one or more growth condi- 
tions should be chosen, and for each, the type of abun- 
dance change (upregulated, no change or downregulated) 
should be defined. Additionally, the user can filter results 



to a specific organelle. Results are presented and can be 
downloaded as described earlier in text. 

3. Search by movement. A schematic diagram depicting 
the types and numbers of changes in cellular localization 
observed during yeast growth in the three environmental 
stresses is available. In a clickable fashion, the user can 
choose one arrow (representative of one type of localiza- 
tion shift in a specific condition) and chck the search 
button. Results provide the entire hst of proteins classified 
as undergoing this shift and can be downloaded as 
described earher in text (Figure IB). 

Using tfie advanced search to study transcriptional 
networks 

Using the data in LoQAtE provides the opportunity to 
uncover important layers of cellular function that could 
not have been uncovered by mRNA levels alone. 
Uncovering the extent of post-transcriptional regulation 
can be easily done by comparing all changes observed 
under a specific condition with the transcriptional 
response under this condition (14). These types of data 
may potentially be further characterized to define 
mRNA motifs or secondary structures affecting transla- 
tion efficiency or mRNA stability as well as protein level 
signals such as stress-dependent degrons or binding motifs 
that affect protein abundance in the absence of 
transcription. 

To demonstrate one such utility, we have used the 
'Search by Abundance' and 'Search by Movement' to 
retrieve the entire group of transcription factors (TFs) 
that change abundance or localization during nitrogen 
starvation out of a hst of all predicted and known TFs 
in yeast (courtesy of Professor Eran Segal) (see Figure 2A 
and B). Interestingly, comparing these 33 proteins with 
their mRNA levels under the exact same conditions un- 
covered that the transcripts of 48% of them had no de- 
tectable change (14). To uncover the potential affect of 
these post-transcriptional changes on adaptation to star- 
vation we used YEASTRACT (22-24) to predict their 
potential targets. Then, we calculated the percentage of 
abundance changes discovered in LoQAtE that can be 
explained by these TFs (see Figure 2C). Intriguingly, an 
astounding fraction of changes in protein levels measured 
under nitrogen starvation could be explained by the post- 
transcriptional changes of this handful of TFs. As these 
conditions have been extensively studied from the tran- 
scriptional perspective (14) without the post-transcrip- 
tional knowledge, this means that the conclusions 
reached might have been incomplete. 

SIMILAR DATABASES 

LoQAtE DB is the first ever resource that particularly 
concentrates on the dynamic characteristics of the yeast 
proteome. The growing interest in directly measuring 
proteomic features has brought about several additional 
DBs presenting systematic proteomic data. The pioneering 
yeast localization DB (http://yeastgfp.yeastgenome.org/) 
provides information about the localization and 
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Figure 1. Screen shots of two result windows in a LoQAtE search. The LoQAtE allows various search functions to be performed. Both quick search 
and advanced search (Search by abundance or localization) (A) enables querying for large groups of proteins and results in the entire localization and 
abundance data for each protein being presented. In addition, the results of each search can be downloaded in an excel table or as raw images. 
'Search by movement' (B) allows the user to choose a movement of interest represented as an arrow on a schematic representation of the cell and 
receive information on all proteins that are assigned to this category. 
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Figure 2. Application of LoQAtE to study regulatory networlcs. Two advanced search options were used to put-together a hst of all yeast TFs that 
changed abundance (search by abundance) or localization (search by movement). (A) Example of change in localization during nitrogen starvation. 
Scale bar represents 5 ]x. (B) The hst of 33 TFs was compared with microarray data on expression changes under the same stress condition (14) to 
uncover that 16 of them were not documented to have any change in their transcripts under the exact same experimental conditions. (C) Using the 
DB for regulatory motifs (YEASTRACT), we extracted the targets for these 16 TFs and found that an enormous fraction of all changes in protein 
levels found in LoQAtE during nitrogen starvation (using the Search by Abundance option) (22-24) can be explained by 10 of the TFs. 



abundance of the majority of yeast proteins under 
standard laboratory growth conditions based on the con- 
struction of the GFP collection (18). Additionally, re- 
analysis of the GFP collection and additional proteins 
involved in lipid metabolism was performed by high- 
resolution confocal imaging (25) and is available at 
the YPL + DB (http://yplp.uni-graz.at/index.php) (26,27). 
For a broader range of proteomic data, the Yeast 
Resource Center (YRC) (http://depts.washington.edu/ 
yeastrc/) is a collaborative web site that gathers several 
core technologies and provides a breadth of information 
such as quantitative mass-spectrometry, sequence- 
function relationships, microscopy, structure and compu- 
tational strategies for structure-function predictions. 
Finally, organelle DB (http://organelledb.lsi.umich.edu/ 
index. php) compiles protein localization data that are 
organized into >50 organelles, subcellular structures and 
protein complexes (28,29). The data set includes 138 or- 
ganisms spanning the eukaryotic kingdom and incorpor- 
ates ongoing results from large-scale studies of protein 
localization in yeast S. cerevisiae. 

A similar effort in human cell lines is based on the 
library of annotated reporter cell-clones (LARC) in 
which each strain (out of 2180) contains an endogenous 
protein fused to yellow fluorescent protein (YFP) (30). 
This DB (http://www.weizmann.ac.il/mcb/UriAlon/Dy 
namProt/) provides the expression and localization of 
each protein over time in response to various drugs and 
is regularly updated with newly created clones (30). A 
more comprehensive human proteome atlas has recently 
become available and harbors expression data for thou- 
sands of human proteins from a large number of healthy 
tissues, tumors, cell lines and subcellular localizations, all 
based on antibody staining (http://www.proteinatlas.org/) 
(31). A principally similar DB called 'PeptideAtlas' 
(http://www.peptideatlas.org/) harbor growing data on 
proteins expressed under various conditions for a wide 



variety of organisms from yeast to human, collected by 
tandem mass spectrometry (MS/MS) (32). 

OUTLOOK 

The LoQAtE DB, in its first version, contains easily ac- 
cessible and user-friendly information about subcellular 
localization and levels for 5330 yeast proteins under 
three environmental stress conditions and two genetic per- 
turbations. The richness of post-translational events docu- 
mented in this DB undedines the significance of studying 
the level of proteins to capture a true picture of a cell's 
response to stress. Starting from behavior of individual 
proteins and up to a bird's eye view on the entire ceU as 
a hoHstic functional unit, this DB enables scientists to gain 
a new insight on cellular dynamics. As more screens are 
performed in our laboratory, they wiU be added into this 
publicly available atlas. This rich resource should serve for 
multiparametric searches and discovery of new protein 
functions illuminated only by the combination of such 
vast amounts of data. 
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